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Construction of a cDNA Library from the Testis and Sequence 
Analysis of the Ubiquitin Gene from Rana nigromaculata 
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Abstract: A full-length cDNA library from the testis of dark-spotted frogs (Rana nigromaculata ) was constructed 
with the SMART (switching mechanism at 5’ end of RNA transcript) technique. Total RNA was extracted from the testis 
and reverse transcripted into full-length cDNA using PowerScript reverse transcriptase. The first-strand cDNA was ampli- 
fied using long-distance PCR (LD-PCR). After Sfi I digestion and fractionation, cDNA ( > 500 bp) was ligated to À 
TriplEx2 vector and packaged with Gigapack® |] Gold Packaging Extract. The titers of optimal primary libraries were 2.0 
x 10° pfu/mL and 2.4 x 10° pfu/mL and the titers of the amplified libraries were 0.48 x 10° pfu/mL and 3.0 x 10° pfu/ 
mL, respectively. The percentages of recombinant clones of primary libraries and amplified libraries were all over 90% . 
The libraries were converted into pTriplEx2 plasmids in Æ. coli BM 25.8 strain. The insert sizes were measured by 
PCR which showed most fragments were over 500 bp and the average length was 1.0 kb approximately. A positive 
clone of 1 171 bp was sequenced and named RnUb based on sequence similarity with the known ubiquitin genes in Gen- 
Bank. This sequence was a full-length cDNA with complete coding sequences, which indicated that the library built a base 
for screening the full-length cDNA. These data showed that this library attained to the requirements of a standard cDNA li- 
brary. This library provided a useful resource for the functional genomic research of Rana nigromaculata . 


Key words: Rana nigromaculata ; Total RNA; SMART; cDNA Library; Ubiqutin 


EMAA CDNA REWWERiZARAR INA 
Ae, RO, BE H, WAR, RHI E* 


(LEME AE ETRE BE, ECHR SETH 241000) 






































HE: OKA SMART (switching mechanism at 5’ end of RNA transcript) ERE T ERER ( Rana nigromaculata ) 
HRHERERK cDNA WE, HAE ER RE SALE, RNA, FA Powerscript™ J #4 a ME HE Se A RS HE cD- 
NA; HH LD-PCR ARM CDNA; AiE SA [| MEYA Chroma spin-400 #7} 4a, 500 bp UEA BES ATriplEx2 
BIKE, FLAN Gigapack® Ill Gold Packaging Extract GREHE, BURR UR CRE. DUR CRE EITI N 8 Bl 
PACE, ALR EGA CE ANT BE PBA 2.0 x 10° pfu/mL Fil 2.4 x 10° pfu/mL, PSR A CER EBA 0.48 x 
10° pfu/mL #1 3.0 10? pfu/mL, BZA RIJE 90% LAE. M E. coliBM25.8 APRS SCRE IOAN pTriplEx2 ME, 
Pkve— PAE SE MEVEAT PCR RW, BARA BORIS EAN 1.0 kbo HE — PAPE TEREST HMA 5" A 3 "ETT UY, 
EI—KA 1171 bp WW, BRAM, ARIE ACE SHE, FT aS 305 PAR, E&— ek cDNA 
FPS. HEA TREC EE AT WAFER cDNA WIE, ZARA, UPROAR oe RHR CDNA NEHA LSE in 
HIP A EE EERE o PASC PEI A EK De APES AI FA Me AS LVS FH SE ES) ee HE PL YR A: SE ae BE ET 
BED Fb, BERK Ay WEE HES h YP SH DE AMD AE FACET RATE BE OP FEAL 
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To date, sequencing and characterization of com- 
plementary DNA (cDNA), which represents a direct 
link to functional genomics, is a powerful means of i- 
dentifying genetic polymorphisms and is essential for 
determination of differential gene expression (Dias et 
al, 2000; Wiemann et al, 2003; Draper et al, 2002) , 
not only in humans but also in other species. However , 
full-length cDNAs of very few species can be obtained 
from the public database currently. So far, there are 
1 576 487 expressed sequence tags (ESTs) of Amphibia 
and 11 756 full-length cDNAs from Xenopus tropicalis 
and X. laevis, the model organisms of Amphibia, ob- 
tained from the National Center for Biotechnology Infor- 
mation (NCBI) website (http://www. ncbi. nlm. nih. 
gov/entrez/query. fegi and http://www. ncbi. nlm. 
Very limited full-length 
cDNAs from other species of amphibians were found in 


nih. gov/FLC/getmgc. cgi ). 


the public database. To identify more full-length genes 
of amphibians including characterization-specific ex- 
pressed, new or unknown genes and study their func- 
tions further, construction of cDNA libraries from other 
species of amphibians is an efficient method. 

The widespread dark-spotted frog, Rana nigro- 
maculata , is a representative of Ranidae, Anura, Am- 
phbia (Chen, 1991), and plays an important role in 
the ecological balance of nature. The amphibians repre- 
sent a bridge in the evolution of vertebrates from aquat- 
ic to terrestrial. They have highly specialized morpho- 
logical and functional characteristics to adapt to differ- 
ent environments (Fei, 1999). The sex determination 
and differentiation of amphibians were not as typical as 
other vertebrates. In mammals and birds, sex determi- 
nation and differentiation are chiefly controlled by ge- 
netic factors and many reptiles exhibits typical tempera- 
ture determination (Zhou et al, 2004). 


isn’t the same in amphibians. There is no difference in 


However, it 


sex chromosomes of many amphibians, even no sex 
chromosomes in some species (Zhou et al, 2004). Sex 
determination in the majority of amphibians is probably 


(Zhou et al, 


2004), however, in all amphibians, genetic sex deter- 


influenced by environmental factors 


mination (GSD) seems to operate (Schmid & Stein- 
lein, 2001). Genetic factors such as DMRT1 and 
DAX1 may be involved in the differentiation of testis 
(Shibata et al, 2001; Sugita et al, 2001). Despite the 
accumulated evidence that GSD is operating in Anura 
and Urodela, there is little substantial information 
about how it functions (Schmid & Steinlein, 2001) . 
On account of background mentioned above, in 
this paper we isolated total RNA of the testis from adult 


dark-spotted frogs to construct a full-length cDNA li- 
brary and expected that this library may present some 


new molecular materials for this species. 


1 Materials and Methods 


1.1 Animals and reagents 

Adult dark-spotted frogs ( Rana nigromaculata ) 
were obtained from Wuhu, Anhui, China. A total RNA 
Extraction kit (Trizol® Reagent) was purchased from 
Invitrogen Company. A cDNA Library construction kit 
(SMARTTM cDNA library Construction Kit) was ob- 
tained from Clontech Company. The Gigapack®l] Gold 
Packaging Extract was the product of Stratagene Com- 
pany. A 100 bp-ladder marker (from 100 to 2000 bp ) 
was purchased from Vitagene Company . 
1.2 Total RNA extraction 

Total RNA was isolated from the testis of two adult 
frogs according to the protocol of Trizol® Reagent and 
was dissolved in Milli-Q grade water. The purity of to- 
tal RNA was checked with measuring the absorbance of 
ultraviolet light at 260 nm (A260) and 280 nm (A209) . 
To examine the integrity and stability of total RNA, a 
5-uL sample and another 5-uL sample that was incubat- 
ed at 37°C for two hours before electrophoresis, were 
run on a denaturing formaldehyde agarose gel. 
1.3 cDNA library construction 
1.3.1 


mately 0.9 ug of RNA was reverse-transcripted into 


cDNA synthesis and amplification Approxi- 
first-strand cDNA by PowerScript!™ reverse transcrip- 
tase. Subsequently approximately 2 uL of first-strand 
cDNA sample was amplified using long-distance PCR 
(LD-PCR). The product was analyzed by running a 5- 
pL sample on a 1.1% agarose/EtBr gel alongside DNA 
size markers. 

1.3.2 Ligation of cDNA to ATriplEx2 vector and 
packaging The double-stranded cDNA (ds-cDNA ) 
was treated with protein K, and followed by Sfi | di- 
gestion and size fractionation. The first four peak frac- 
tions containing cDNA (>500 bp) were pooled togeth- 
er for packaging. 

Three parallel ligation reactions were performed 
using three different ratios of cDNA to the ATriplEx2 
vector (1:0.5, 1:1, 1:1.5) to ensure the optimal pos- 
sible library was obtained. Each of the three ligations 
was packaged with Gigapack [I] Gold Packaging Ex- 
tract. 

1.4 Titration of the primary library and deter- 
mining the percentage of recombinant clones 

The primary library was diluted by 1:5, 1:20, 
1:50 and 1:100. The number of clones was counted to 
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calculate the library titer according to the formula: pfu/ 
mL = number of plaques x dilution factor x 10° pL/mL 
(uL of diluted phage plated). The recombination effi- 
ciency was identified by blue/white screening in E. 
coli XL1-Blue. 
1.5 Library amplification and conversion of cD- 
NA library to plasmid 

The primary libraries were amplified on 40 plates 
of 12 cm diameter. The titer and recombinant efficiency 
were calculated using the same method as the above. A 
small quantity of R. nigromaculata cDNA library was 
converted into plasmid in Æ. coli BM 25.8 strain. 

One hundred clones were randomly picked to 
check the insert fragments by PCR using 5’ sequencing 
primer and 3’ sequencing primer. Sequencing analysis 
was completed with an ABI 3730 automatic DNA se- 
quencer. 
1.6 cDNA cloning of R. nigromaculata ubiquitin 

gene and comparison to other homologues 

One full-length cDNA of R. nigromaculata ubiq- 
uitin gene was gained using random sequencing analysis 
and was identified using the BLAST in the NCBI 
database in order to find homologues from other 
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Fig. 1 
Lane 1: a sample of 5-uL total RNA; Lane 2: a sample incubated at 
37 for 2h. 


Total RNA from testis of Rana nigromaculata 


2.2 Synthesis of cDNA 

The SMART technique was used to reverse tran- 
script RNA into cDNA, which was gained by LD-PCR 
with the adaptor primers. Incomplete cDNA and cDNA 
transcribed from poly A- RNA will lack the SMART an- 
chor and not be amplified. 

As shown in Fig. 2, the ds-DNA appeared as a 
smear of bands of 0.5 - 2 kb on the gel. The product 
of 25 cycles appeared as an intense low-molecular- 
weight ( < 500 bp) smear and subsequently the product 


species. All the deduced amino acid residues were 


aligned using Clustal-X (Thompson et al, 1997). 
2 Results and Analysis 
2.1 Total RNA extraction 


The key to construction of an excellent quality cD- 
NA library is to prepare good quality RNA first. The 
ratio of the readings at A26 and Avgo ( A260/A280) was 
approximately 1.82. The concentration of RNA was ap- 
proximately 0.9 pg/L according to the absorbance of 
ultraviolet light at 260 nm. As shown in Fig. 1, two 
bright bands of 18S rRNA and 28S rRNA can be seen 
clearly, indicating that the total RNA is integrated and 
stable enough for cDNA library construction. The sta- 
bility of RNA was verified by incubating a small sample 
at 37°C for two hours. There was little difference be- 
tween the incubated and the fresh samples, appeared in 
Fig. 1. If the RNA appeared to be unstable, the incu- 
bated sample will be weaker than the unincubated sam- 
ple. The total RNA isolated from the testis of frogs of 
R. nigromaculata was pure, integrated and stable for 
cDNA library construction. 





Fig. 2 The products of LD-PCR 
M: marker; Lane 1: the product of LD-PCR with 25 cycles; Lane 2: 
the product of LD-PCR with 22 cycles. 


of 22 cycles was chosen for next step. Fragments small- 
er than 500 bp were eliminated by cDNA fractionation 
to avoid that the library had a preponderance of very 
small inserts or/and apparently non-recombinant 
clones. 
2.3 Characterization of cDNA library 

As shown in Tab. 1, the titers of three primary li- 
braries constructed from three different ratios of cDNA 


to vector (1:0.5, 1:1, 1:1.5) were (1) 0.8 x 10° 
pfu/mL, (2) 2.0 x 106 pfu/mL, (3) 2.4 x 106 pfu/ 
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mL, which showed that the optimal ratio was 1:1.5. 
The capacity of the three libraries was as follows (1) 
0.4 x 10° clones, (2) 2.0x 10° clones, (3) 2.4 x 10° 
clones. The primary libraries with titers of over 1.0 x 
10° pfu/mL were both amplified. The titers of ampli- 
fied libraries were 0.48 x 10? pfu/mL and 3.0 x 10° 


pfu/mL, respectively. The recombination efficiencies 


of the primary and amplified libraries were all over 
90% , appeared in the Tab. 1. The insert ratio and the 
average length of inserted fragments were measured by 
PCR, as shown in Fig. 3. The average size was ap- 
proximately 1.0 kb and majority inserts were all over 
0.5 kb. 

2.4 Sequence analysis of the ubiquitin gene 


Tab. 1 Characterization of the primary and amplified cDNA libraries of R. nigromaculata 











Number of Ligation ratio Titer of Capacity of Recombination Titer of Capacity of Recombination 
primary of cDNA primary libraries primary libraries efficiency of amplified libraries amplified efficiency of 
libraries to vector (pfu/mL) (clones) primary (pfu/mL) libraries amplified 

libraries (clones) libraries 
(%) (%) 
(1) 1:0.5 0.8 x 10° 0.4 x 10° 97.4 - - - 
(2) 1:1 2.0 x 10° 1.0 x 10° 95.6 0.49 x 10° 3.2 x 10" 90.3 
(3) 1:1.5 2.4x 10° 1.2 x 10° 98.7 3.0x 10° 9.6x 10"! 92.7 
Capacity of library = Titer of library x Volume of library. 
M 1 16 M 
(bp) (bp) 
2 000 2 000 
1 500 1 500 
1 000 1 000 
500 500 





Fig. 3 Part of the inserted cDNA fragments randomly selected from the amplified cDNA library 


M: marker; Lane | - 16: 16 inserted cDNA fragments. 


A random positive clone was sequenced and a frag- 
ment of 1 171 base pairs (bp) was obtained (GenBank 
accession number DQ520795 in the NCBI database ) . 
The sequence information is shown in Fig. 4. The frag- 
ment has four initial codons of ATG, a termination 
codon TAA, a poly-A tail of 30 As and a signal se- 
quence of AATAAA for adding the poly-A tail. The 
open reading frame (ORF) of 918 bp (from the140th bp 
to the 1 058th bp) encodes 305 amino acids (aa). Four 
repeated units (r1 from the 140th to the 367th; r2 from 
368th to 595th; r3 from 596th to 823rd and r4 from 
824th to 1 058th) were shown in the sequence, of 
which the first three units are all 228 bp encoding 76 aa 
and the last unit consists of 234 bp encoding 77 aa. The 
sequence was a full-length cDNA, with a complete cod- 
ing sequence (CDS), indicating that the library con- 
structed in this paper provided a physical resource for 
full-length cDNA clones. 

The four nucleotide repeats in Fig. 4 show similar- 
ity with each other: rl and r2 show 91% nucleotide ho- 
mology; r3 has 35 different nucleotides from rl and 
they have 85% homology; r4 and rl are 77% homolo- 
gous differing at 43 sites. The nucleotide changes a- 


mong the four repeats are at the second or/and third nu- 
cleotide position that are silent variations, but the base 
‘T? at position 650 of r3 is different from the base ‘C?’ 
in other repeats results in the amino acid change at po- 
sition 19 (Ser not Pro in repeat 3). It is unclear if this 
is due to experimental error or if it is a specialized 
change in this species. 

As shown in Fig. 5, the RnUb protein has similar 
characteristics and shows at least 96% similarity with 
polyubiqutin of other species, which indicates that the 
ubiquitin gene is conserved through the process of evo- 
lution. Except the last aa, the predicted amino acid se- 
quences of ubiquitin of the dark-spotted frog, human, 
mouse, the western clawed frog and the salmon trout 
only differ at the 171% aa (that is the 19™ aa in repeat 3 
shown in Fig. 4), where a serine exists in RnUb while 
a proline exists in the other four species. This may infer 
that the change of nucleotide at 650" position in RnUb 
is due to experimental error. We assume that there 
would be strong selective pressure in the evolution of 
the ubiquitin gene, as most substitutions are silent vari- 
ations at the nucleotide level and few are amino acid re- 
placement substitutions . 











1# ABA 





EESE: RREN RHR CDNA NEWI EIZ KIEF I AYP IT BENT ) 























13 








5’ UTR 1 GCAGGACAAGGAAGGTATC 19 
20 CCTCCAGATCAGCAGAGGCTGATTTTTGCTGGCAAGCAGCTAGAAGATGGCCGCACCCTC 79 
80 TCTGACTACAATATCCAGAAGGAATCCACTCTGCACTTGGTGTTGCGTCTGAGGGGTGGG 139 
IMQIFVKTLTGKTITLE VE PS 20 
(r1) 140 [ATGCAAATCTTTGTCAAGACCCTAACTGGCAAAACTATTACTTTGGAAGTTGAGCCAAGT 199 
(T2) 368. ------------------------------------------------------ CCA--- 427 
(3) 596 ==- T----A--A--G--C-—-----—--—— G--G---TCC--- 655 
(r4) 824 -------- A-----G-----= T-----—- T-----C-----— (-A-------- ACCA--C 883 
2.DTIEBNVYK AK IQDKEGIP P D Q40 
(r1) 200 GACACAATTGAGAACGTCAAAGCTAAAATCCAGGACAAGGAAGGTATCCCTCCAGATCAG 259 
(r2) 428 -------------- T-——----—--———- T------—- A- T--------———- 487 
(r3) 656 ~------~------ T-------------- T-------- A-——--——- T---------——- 715 
(r4) 884 ———--T-----A--T--6--6--C-—---T--A--T--A-----C--T----—-----—-- 943 


41Q R L I F AG K QL EODGRTLS D Y N 60 
(r1) 260 CAGAGGCTGATTTTTGCTGGGAAACAGCTAGAAGATGGCCGCACCCTCTCTGACTACAAT 319 











(12) 488 ---------- manana (nen ~----=T---=---- T--A-------- C 547 
(r3) 716 ~-A--A----------- A--(----------------- T-------- T--A-------- C 775 
(r4) 944 --A-----T--C-----A-~A---~----G-----------T--TT-G--A--T--T-€._ 1003 
6G IQKESTLHLYVYLRLRGG F stop 
(r1) 320 ATCCAGAAGGAATCCACTCTGCACTTGGTGTTGCGTCTCAGGGGTGGG 367 
(12) $48 -----A------~----~ T-A--TC-T--CC-C--A---—A-- 595 
(3) 176 -—------A-----T--C--C--TC-T--TC-C-----A--A--A--A 823 
4) 1004 --T-----A—----T-----—---(-T--(C-----C----------- TIT Tf ad) 1057 


3’ UTR 1058 GATGACAATCAGCAGTACTGCTTTTCCTTGCATTTAAGTTTATGCTTCTCTTGCACTTTT 1117 
1118 GTTTCAATAAAGTAATTGCATTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


Fig. 4 Full-length cDNA sequence and deduced amino acid sequence of ubiqutin 


gene of R. nigromaculata 


5’ UTR: 5’ untranslated region of 139 bp; 3’ UTR: 3’ untranslated region of 84 bp; rl, r2, r3 and r4 indicate 


the four repeats; ‘-’: the short lines means the same nucleotide as the first rl; ATG: initiation codon; TAA: ter- 


mination codon; AATAAA indicates a polyadenylation signal sequence; P/S means ‘P?’ or ‘S? at position 19 in 
the aa sequence; The aa of the 19th position are encoded by CCA of rl, r2 and r4 or by TCA of r3; CCA encodes 


the P (Pro) while TCA encodes S (Ser). 


3 Discussion 


Conventionally-generated cDNA libraries contain a 
high percentage of 5’-truncated clones ( Wellenreuther 
et al, 2004). Fortunately, the SMART technique can 
overcome it. There are four advantages of using this 
method to construct full-length libraries at least: First, 
the terminal transferase activity of PowerScript reverse 
transcriptase adds a few additional nucleotides, primar- 
ily deoxycytidine , to the 3’ end of the cDNA, when the 
RT reaches the 5’ end of the mRNA. The SMART [V 
Oligo, with an oligo (G) at its 3’ end, base-pairs with 
the deoxycytidine stretch, creating an extended tem- 
plate. Second, cDNA synthesis employs long-distance 
PCR for generating full-length cDNA with two adaptor 
primers. Incomplete cDNAs and cDNA transcribed 
from poly A- RNA will lack the SMART anchor and not 


be amplified (Chenchik et al, 1998). Third, frag- 
ments smaller than 500 bp were eliminated by cDNA 
fractionation. Fourth, asymmetrical Sfi | (A & B) re- 
striction enzyme sites at the adaptor primers at the 5’ 
and 3’ cDNA ends are extremely rare in mammalian 
DNA; therefore, all the cDNAs remain intact after Sfi 
I digestion Methylation steps are eliminated; valuable 
internal restriction sites are preserved. The vector 
ATriplEx2 containing the asymmetrical Sfi ] sites in 
the multiple cloning site (MCS), eliminates adaptor 
ligation and facilitates directional cloning. Actually the 
SMART method has become one of the conventional 
methods to obtain the full-length cDNA (Chenchik et 
al, 1998). 

There are three chief aspects that identify the 
quality of a cDNA library. One is the capacity of the 
library. The capacity should be more than 1.7 x 10° 
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. 5 Similarity of putative amino acid sequences of RnUb with the ubiqutin 


gene from nine other species 


The asterisk indicates identical aa residues, and ‘.’ or ‘: indicate strongly positive or weakly positive residues. 


The ‘S? is the only amino acid by which the frog, human, mouse and western clawed frog ubiquitin gene differed . 
The four repeat units were marked with (R1), (R2), (R3) and (R4). The following eight species have been 


used in this figure: 


(AAH21837), Ovis aries (AAB92373); a bird, Gallus gallus (XM - 415847); one amphibian, Xenopus tropi- 


calis (AAH74652); a fish, Oncorhynchus mykiss (AAK51460) ; one insect, Bombyx mori (BAA76676) ; and a 


three mammals, Homo sapiens (accession number AAHO08955), Mus musculus 


yeast Kluyveromyces lactis (CAB50898) . 
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clones to ensure that the low-abundance mRNA would 
be present in the library (Sambrook & Russell, 2001; 
Li et al, 1998). The primary library had no less than 
0.4 x 10° clones, which in principle was sufficient for 
inclusion of most rare mRNA. The high recombination 
efficiency is another index of good quality library (Li et 
al, 2006). In this paper, the recombination efficiency 
of primary and amplified libraries was all over 90%. 
The third aspect is that the average length of inserted 
cDNA should be no less than 1.0 kb to ensure the in- 
tegrity of cDNA (Jin et al, 2004; Tanaka et al, 1996; 
Yang et al, 2004). The average size of inserted frag- 
ments in this study was 1.0 kb approximately. The 
full-length cDNA library constructed from the dark- 
spotted frog conformed to the requirements of a stan- 
dard library. This library provided a useful resource for 
the functional genomic research of R. nigromaculata . 
cDNA libraries are widely used to identify genes 
and splice variants, and as a physical resource for full- 
length clones (Wiemann et al, 2003). The regions en- 
coding proteins of full-length cDNA are usually con- 
served, and may have some similarities with closely re- 
lated regions, which can help us to identify and ana- 
lyze the protein encoding genes (Gracey et al, 2001). 
For example, the RnUb obtained in the library as a 
full-length cDNA showed similar characteristics with u- 
biquitin of other species. In the previous reports, ubiq- 
uitin, a 76-amino acid protein, is one of the most con- 
served proteins known in eukaryotic cells, with only 
three amino acid changes from yeast to humans and it 
can be covalently attached to a variety of cellular pro- 
teins and plays a pivotal role in many aspects of eu- 


karyota (Amerik & Hochstrasser, 2004; Deng, 2000; 
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